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Tests of the universality of free fall and the weak equivalence principle probe the foundations of 
General Relativity. Evidence of a violation may lead to the discovery of a new force. The best 
torsion balance experiments have ruled it out to 10 -1 'b Cold-atom drop tests have reached 10 -7 
and promise to do 7 to 10 orders of magnitude better, on the ground or in space. They are limited 
by the random shot noise, which depends on the number N of atoms in the clouds (as 1 /y/N). As 
mass-dropping experiments in the non-uniform gravitational field of Earth, they are sensitive to the 
initial conditions. Random accelerations due to initial condition errors of the clouds are designed 
to be at the same level as shot noise, so that they can be reduced with the number of drops along 
with it. This sets the requirements for the initial position and velocity spreads of the clouds with 
given N. In the STE-QUEST space mission proposal aiming at 2 • 10 -15 they must be about a 
factor 8 above the limit established by Heisenberg’s uncertainty principle, and the integration time 
required to reduce both errors is 3 years, with a mission duration of 5 years. Instead, offset errors 
at release between position and velocity of different atom clouds are systematic and give rise to a 
systematic effect which mimics a violation. Such systematic offsets must be demonstrated to be as 
small as required in all drops, i.e., they must be kept small by design, and they must be measured. 

For STE-QUEST to meet its goal they must be several orders of magnitude smaller than the size 
-in position and velocity space- of each individual cloud, which in its turn must be at most 8 times 
larger than the uncertainty principle limit. Even if all technical problems are solved and different 
atom clouds are released with negligible systematic errors, still these errors must be measured; and 
Heisenberg’s principle dictates that such measurement lasts as long as the experiment. While shot 
noise is random, hence its reduction becomes apparent as more and more drops are performed, 
the systematic effect due to offset errors at release must be identified through its specific known 
signature, and measured in order to be distinguished with certainty from the signal. This requires 
many well designed measurements to be performed -each to the target precision- for it to be ruled 
out as a source of violation. Ways may be pursued in order to mitigate the limitations identified 
here. 

PACS numbers: 37.25.+k 04.80.Cc 


I. INTRODUCTION 

General Relativity (GR) rests on the experimental fact 
that in a gravitational field all bodies fall with the same 
acceleration regardless of their mass and composition. 
This is known as the “Universality of Free Fall” (UFF), 
and it is also referred to as the “Weak Equivalence Prin¬ 
ciple” (WEP) though it is by no means a Principle of 
physics but rather a fact of nature that all experiments, 
from Galileo till the present time, have confirmed [l] . Ex¬ 
perimental evidence of a violation would result in a sci¬ 
entific revolution, because in such a case either GR must 
be amended or a new force of nature (fifth force) is at 
play. 

UFF/WEP in the field of Earth has been tested with 
macroscopic proof masses of different composition by 
dropping them from a height and by suspending them 
on a torsion balance. The dimensionless parameter 
?7 = A a/a which quantifies a violation is obtained by 
measuring the differential acceleration Aa of the proof 
masses relative to each other (Aa is the physical observ¬ 
able quantity) as they fall with an average acceleration 
a towards the Earth (a is referred to as driving accel¬ 


eration). If UFF/WEP holds, r] = 0; the smaller the 
value of 77 , the more sensitive is the experimental test. In 
the field of Earth the driving acceleration is ~ 9.8 ms -2 
for mass dropping tests and ~ 1.69 • 10 - 2 ms -2 at most 
(at 45° latitude) for proof masses suspended on a torsion 
balance. For the same sensitivity to differential accelera¬ 
tions, mass dropping tests would yield a smaller value of 
r], i.e. a better UFF/WEP test, by almost a factor 600. 

In spite of this big advantage, drop tests have mea¬ 
sured 77 ~ 7 • 10 -10 [2] while slowly rotating torsion bal¬ 
ances [3] have done 4 orders of magnitude better, reach¬ 
ing rj — 10 - , and finding no violation. These figures 
show a higher sensitivity to differential accelerations of 
the torsion balance as compared to mass dropping ap- 
parata by about 4 million times. The parameter 77 is 
named “Eotvos parameter” in honour of the Hungarian 
physicist Roland von Eotvos, who first used the torsion 
balance for testing the universality of free fall. 

If UFF holds the proof masses fall with the same accel¬ 
eration. However, if their centers of mass at initial time 
happen to be located at different heights relative to the 
center of mass of the source body, a classical differential 
acceleration arises due to the fact that the gravitational 
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field is not uniform (there is a non zero gravity gradient, 
or tidal differential acceleration), which mimics a viola¬ 
tion. Drop test [2] is unique (and the best) among drop 
tests in that the proof masses are coupled to one another 
as two halves (made of A1 and Cu respectively) of a sin¬ 
gle vertical disk, as suggested by E. Polacco. During the 
fall the disk is sensitive only to differential accelerations 
between the centers of mass of its two halves. Should 
the Earth attract them differently ( violation ), the disk 
would rotate about the horizontal axis, an effect that 
the authors could measure very precisely by means of a 
modified Michelson laser interferometer in which the two 
arms terminate on two corner-cube reflectors mounted 
on the rim of the disk. Ideally, this is a null experiment: 
no differential effect => no signal. However, it still de¬ 
pends upon release errors (the disk cannot be dropped 
with exactly zero rotation rate), which turn out to be 
the limiting factor, as the authors themselves show |2j. 

In the torsion balance the proof masses are coupled 
and there is a position of equilibrium which depends on 
the design and construction properties of the balance and 
its suspension fiber, not on the initial conditions. At 1 -g 
only forces that act on the proof masses along directions 
which are not parallel to each other (as it would hap¬ 
pen in case of violation) do affect the balance by causing 
a non zero torque along the fiber -which is aligned with 
the local gravitational acceleration g- thus displacing the 
equilibrium position. The Earth’s gravity gradient gives 
a spurious torque along the fiber only by coupling to im¬ 
perfections in the geometry of the balance, which results 
in a difference in the directions of its effect on the proof 
masses [2]. 

At a first glance totally free proof masses appear as the 
best choice to test if they fall with exactly the same ac¬ 
celeration in the field of Earth, both on the ground and 
in space. Since the proof masses must be sensitive to 
extremely small differential forces, the weaker their cou¬ 
pling the better, and one may be tempted to push weak 
coupling to the limit of no coupling at all (i.e., totally 
free masses). For macroscopic masses this issue has been 
thoroughly investigated by many authors E5H2], includ¬ 
ing J. P. Blaser [5], who was the leading scientist of the 
STEP (Satellite Test of the Equivalence Principle) stud¬ 
ies when this proposal was selected by ESA (European 
Space Agnecy) as a medium size candidate mission for 
two times, first in collaboration with NASA and then as 
an ESA only mission cm All studies ISHSI have demon¬ 
strated beyond question that drop tests are not at all the 
best choice, on the ground as well as in space, because 
the initial condition errors in combination with gravity 
gradient result in a large systematic effect which severely 
limits these experiments. 

As shown in 0, a nice favourable case is that of the 
Moon and the Earth freely falling in the field of the Sun 
(with an average acceleration g© ~ 6 • 10 -3 ms -2 ), the 
Moon’s orbit being measured by laser ranging to cor¬ 
ner cube reflectors left by astronauts on its surface. In 
this case the gravity gradient from the Sun is very small 


thanks to its very large distance (d©© — 1.5 ■ 10 11 m) 
yielding 7© - Iff - 1-3 • 10 _ 11 5 Q /m. Thus, laser 
ranging with centimeter precision has been able to reach 
10~ 13 (HI](n3) (compatible with ^|*10 _2 — 1-3 -lO” 13 ). 
One order of magnitude improvement is expected with 
the APOLLO laser ranging system at millimiter level m 
once the physical model has been improved accordingly. 
Beyond that, it will be extremely hard to overcome the 
effect of gravity gradient and initial condition errors. 

In recent years, several tests of UFF/WEP have been 
performed by dropping atoms in light pulse atom inter¬ 
ferometers [T2HT6] . They have reached r) ~ 1(D', a fac¬ 
tor 140 worse than drop test (2; and 6 orders of magni¬ 
tude worse than the torsion balance test [3j, but scientists 
promise to do many orders of magnitude better, on the 
ground m or in space [T8II2 1 1 . An additional cold atom 
test, also to g ~ 10“', has been published in 2014 [22] but 
it is not considered in this work because it is not based on 
a mass dropping approach, hence initial condition errors 
as discussed here do not apply to it. 

The effect of initial condition errors as studied so 
far (5H2 was concerned only with macroscopic proof 
masses. In this work we revisit the issue to include atoms 
dropped in light pulse atom interferometers, motivated 
by the fact that the number of atoms at detection is very 
small compared to Avogadro’s number, and therefore the 
extremely small mass of the falling bodies plays a key 
role when considering limitations on position and veloc¬ 
ity errors imposed by Heisengerg’s position-momentum 
uncertainty principle. 

The paper is organized as follows. 

Sec. [Til recalls the basic mathematical formulae for the 
effect of initial condition errors in the measurement of the 
Earth’s gravitational acceleration with atom interferom¬ 
eters. Sec. |III| analyzes the random and systematic effects 
of initial condition errors in cold-atom drop tests of the 
universality of free fall. Sec. [IV] shows how Heisenberg’s 
principle limits such tests (due to the small number of 
atoms if compared to Avogadro’s number) and points 
out the difference in dealing with initial condition er¬ 
rors versus shot noise, because the latter is random while 
the former give rise also to a systematic differential ac¬ 
celeration which mimics the signal and must therefore 
be distinguished from it. In the last section we briefly 
summarize the results and conclude that at present cold- 
atom drop tests of UFF/WEP are not competitive with 
results achieved on the ground by torsion balances and 
with goals pursued in space, all using macroscopic proof 
masses and none being based on a mass dropping ap¬ 
proach. We also offer some indications as to how the 
limitations due to initial condition errors outlined in this 
work can be mitigated (or avoided) in order to improve 
the present level of UFF/WEP tests with cold atoms. 
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II. EFFECTS OF EARTH’S GRADIENT AND 
INITIAL CONDITION ERRORS 

In 1995, by monitoring the motion of a freely falling 
corner-cube retroreflector with a laser interferometer sci¬ 
entists were able to measure the absolute value of the lo¬ 
cal gravitational acceleration g to A g/ g ~ 1.1 ■ 10~ 9 [23]. 

In 1999 the absolute value of g was measured to 
A g/g ~ 3 • 10~ 9 by dropping caesium atoms in a light 
pulse atom interferometer [24, 25]. In this case the key 
optical elements of the interferometer (beam splitters and 
mirrors) are implemented by using stimulated Raman 
transitions between atomic hyperfme groundstates (see 
[26] for details). The atomic wave packet is split, redi¬ 
rected and finally recombined via atom-light interactions. 
The phase that the atoms acquire during the interferom¬ 
eter sequence is proportional to the gravitational acceler¬ 
ation g that they are subjected to. At present shot noise 
is the limiting factor, and it is proportional to 1/ y/N, 
with N the number of atoms at detection. Being ran¬ 
dom, shot noise is expected to decrease as l/y/n, with 
n the number of drops. In a well designed experiment 
any other noise source that needs to be reduced as 1/y/n 
should not exceed the shot noise limit. If so, the number 
of drops required to reduce shot noise to the target pre¬ 
cision will bring all other noise sources below the target 
too. 

The gravity gradient of Earth, combined with the ini¬ 
tial position and velocity of the falling atoms, gives rise to 
a systematic spurious effect on their acceleration (hence 
on the measured phase shift) which cannot be neglected 
if one aims at measuring g to about 10“ 9 . For atoms 
falling in an atom interferometer this effect has been cal¬ 
culated by [2SHZ3 following the tutorial [28]. Only the 
contribution to first order in the gravity gradient 7 is 
relevant and in [ 23 ] it has been reported to be: 

Ag = 7 (it 9t2 ~~ VoT ~ z °) W 

where T is the time interval between successive light 
pulses (up to 160 ms in this experiment), 7 ~ 3-10 -7 g/ m 
is the gravity gradient in the laboratory, z 0 and v 0 are 
the initial position and velocity of the atom. Note that 
because of a misprint, in [24] the second term on the right 
hand side of 0 reads — 7U0 , while it should be multiplied 
by T. 

For a free falling point mass (including one single atom) 
whose initial conditions - nominally zero- have errors 
A z 0 , Av a (in the direction to the center of mass of Earth) 
the first order tidal acceleration at the height of fall z(t) 
is: 

A g(t) = - 2 ^^-z(t) = 7 - Av 0 1 - Az 0 ^ (2) 

with M 0 , 1?0 the mass and radius of Earth, G the uni¬ 
versal constant of gravity and: 

7 = 5 ^-- 3.14 -10”V m ( 3 ) 


the gravity gradient of Earth whose numerical value is as 
given by [24] and reported in ([I]). 

It is interesting to note that the disturbing acceleration 
0 computed for atoms falling in the atom interferometer 
differs from 0 in the coefficient of the quadratic term 
by j^jgT 2 . As pointed out by [25], this discrepancy is 
due to the fact that in the atom interferometer the accel¬ 
eration of the atoms is measured as a second difference 
of their positions at times 0, T and 2 T when -during 
their ballistic flight- they are subjected to light pulses 
(see Sec. 2.1.3 of Peters’ PhD thesis [25]). If we take ([ 2 ]) 
and integrate twice in order to get the position, we ob¬ 
tain three position terms proportional to t 4 , t 3 and t 2 
respectively. We then compute the acceleration by defin¬ 
ing it as the second position difference at times 0, T and 
2 T. We find that for the t 2 and t 3 terms the second po¬ 
sition difference is the same as the corresponding second 
time derivative, while this is not so for the f 4 position 
term. In this case it is easy to see that the second po¬ 
sition difference yields ^7 gT 2 instead of I 7 gT 2 , with 
an acceleration difference by j^^/gT 2 ED]- It is apparent 
that this difference has nothing to do with the 11 quan¬ 
tum mechanics” versus 11 classical mechanics” approach. 
As stated in J25] (Sec. 2.1.3): ... this type of measure¬ 
ment is not intrinsically “quantum mechanical’'. ...We 
can simply ignore the quantum nature of the atom and 
model it as a classical point particle that carries an in¬ 
ternal clock and can measure the local phase of the light 
field. 

The acceleration difference discussed above does not 
affect cold-atom drop tests of the universality of free fall, 
because by taking the difference of the free fall accelera¬ 
tions of two different atom species or isotopes -which is 
the physical quantity to be measured when testing UFF- 
the term quadratic with time in the acceleration 0 can¬ 
cels out. It does not cancel out in experiments to mea¬ 
sure the absolute value of g , such as [ 23 ], in which case 
the systematic effect Q due to initial condition errors in 
combination with the Earth’s gradient had to be consid¬ 
erably reduced in order to measure the absolute value of g 
to 3 • 10 -9 . The authors report a careful systematic error 
analysis that required to perform many measurements of 
g at different heights. By fitting the measurement data 
to the predicted curve the gradient error could be identi¬ 
fied, measured and, to that extent, subtracted (as shown 
by the authors in their table of systematic effects). 

Cold-atom drop tests of the universality of free fall 
have been proposed and investigated by the European 
Space Agency EMU to be performed in space at low 
Earth altitude. In these proposals two overlapped clouds 
of different isotopes fall in a Dual-Isotope-Interferometer 
(DII). The free fall acceleration is measured simultane¬ 
ously for each cloud. By computing their difference, the 
acceleration of interest A g = r)g(h) is derived. 

In absence of weight the leading term measured for 
each free falling atom cloud is the inertial acceleration 
arising because of non-gravitational forces acting on the 
outer surface of the spacecraft. This inertial acceler- 
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ation is huge compared to the target, but common to 
both clouds, and therefore -if the instrument is properly 
designed it can be rejected. If not, it must be com¬ 
pensated by drag-free control of the spacecraft, which re¬ 
quires a proof mass unaffected by non gravitational forces 
(to the level of drag compensation), a sensor to measure 
its motion relative to the spacecraft, and thrusters (with 
the necessary amount of propellant) to make the space¬ 
craft follow the proof mass. With 85 Rb, 8 'Rb a rejec¬ 
tion factor of 4 • I 0 8 is postulated [ 2 T] . At present the 
best measured rejection factors are 550 for 85 Rb, 8 'Rb 
(Ref. [H]) and 303 for 87 Rb and 39 K (Ref. [HTjl. showing 
that an improvement by about 6 orders of magnitude is 
needed to meet the requirement. 

We mention for completeness that on the ground in 
addition to the gradient ([3| there is also a gradient of 
the centrifugal acceleration. It is due to the Earth’s daily 
rotation at angular velocity Uq, and it adds a factor < 
~ 5.4- 10 _1 ° gj m, which in the g measurement |2Tl251 
is negligible. 

At low Earth altitude h the gravity gradient is: 

7 «” = (JSTft) s[h),m (4) 

g(h) being the Earth’s gravitational acceleration at alti¬ 
tude h. Unless the spacecraft attitude is fixed in inertial 
space the centrifugal gradient must be added too, which 
is 1/2 of the gravity gradient ©• 


III. RANDOM AND SYSTEMATIC INITIAL 
CONDITION ERRORS IN COLD-ATOM DROP 
TESTS OF UFF 

From now on we consider the effect of Initial Condition 
Errors (ICE) in cold-atom drop tests of UFF/WEP, and 
neglect the term quadratic in time because in the differ¬ 
ential acceleration of two clouds dropped simultaneously 
it cancels out. 

Let us start with one single atom freely falling in the 
presence of the Earth’s gradient 7 . If it has been released 
at time t = 0 with ICE A z 0 , Av 0 (in modulus) the result¬ 
ing error at time t in its measured free fall acceleration 
is: 


A 9(t) ICB _ singleatom = 7 ( Az ° + Avot ) ( 5 ) 

where 7 is 0 on the ground and © in space. 

If N atoms with these ICE are released together, ran¬ 
dom velocities abate with y/N, and random position er¬ 
rors are yfN smaller too, hence the error in the acceler¬ 
ation measured at time t is: 


A ^)/ CE - 5i „ 


■ < e > 


In the case of a DII, in which the free fall accelerations of 
two atom clouds are measured independently, each one 


with a random error ([h]), the random error in their accel¬ 
eration relative to each other ( differential ) is a factor y/2 
times larger. 

Due to the random nature of noise | 6 |, it can be re¬ 
duced by performing many drops, as long as they are un¬ 
correlated. With n such drops the sigmas of the center of 
mass position and velocity at initial time, i.e. Az a /yfN 
and Av 0 /VN will further decrease as 1 / y/n. 

In a DII there is also a contribution to the differen¬ 
tial acceleration due to position and velocity offset errors 
Azo-rei, Avo-rei (hr the direction to the center of mass 
of the Earth) between the center of mass position and ve¬ 
locity of the two clouds at release relative to each other. 
They arise because of inevitable differences in trapping 
and releasing different isotopes/species -and are there¬ 
fore systematic- yielding a systematic differential accel¬ 
eration: 

A 9 ( K t ) ICE -a,Ss,ts = li^Zo-vel + Avo-rel^j (7) 
which mimics a violation. 

Note that an error in the component of the relative 
velocity along the orbit does also result in a relative po¬ 
sition error in the radial direction (i.e. the direction of 
a violation signal in the field of the Earth) because dif¬ 
ferent along track velocities give rise to different orbital 
radii, the orbital velocity being inversely proportional to 
the square root of the orbital radius. 

It is mandatory to demonstrate that the measured sig¬ 
nal is not due to the systematic error 0. 

A known solution adopted in drop tests with macro¬ 
scopic proof masses consists in dropping masses of the 
same composition using the same apparatus and perform¬ 
ing an experiment as similar as possible to the real one: 
since in this case there must be no violation, the sensi¬ 
tivity measured sets the level of the UFF test that can 
be claimed with this experiment. This check has been 
done very rigorously in the case of drop test [ 2 ], and it 
led to establishing that UFF could not be tested better 
than ~ 7 • 10 _1 ° because errors in releasing the vertical 
disk could not be reduced below this level. A null test 
of this type cannot be done with free identical atoms 
as test masses, because identical atoms cannot be dis¬ 
tinguished. As suggested by [32], one should make the 
two atom clouds slightly different (e.g. by dropping the 
same atom in different metastable states), with a differ¬ 
ence that allows them to be distinguished in the atom 
interferometry measurement, but that is negligible for 
the sought for UFF violation signal. 

An effective alternative when the signature of a sys¬ 
tematic effect is known as in this case, is to perform a 
number of measurements -each to the target precision- 
in appropriately modified experimental conditions such 
that the systematic effect can be separated out based 
on its known dependence on the physical parameters in¬ 
volved. In so doing the systematic effect is separated 
out and measured, so that its contribution to the signal 
of interest can be firmly identified and possibly reduced 
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below the target. Such a careful analysis of systematics 
(e.g. as reported by [24] for the absolute measurement of 
g) obviously requires the integration time needed to com¬ 
plete one single measurement (by reducing random errors 
below the target) to be short enough so that systematics 
can be separated from the signal in a realistic time span. 
This is the case of the torsion balance experiments. 

So far cold-atom drop tests of the universality of free 
fall have been performed on the ground Q 2 HH] reaching 
77 ~ lir 7 , a factor 140 worse than drop test (2] and 6 or¬ 
ders of magnitude worse than the torsion balance test [3] . 
With 7 ~ 3.14 • 10 ” 7 < 7 /m, the effect of initial condition 
errors is not a limitation at this level. 

A cold-atom drop test has been proposed in 2007 m 
aiming initially at 77 = 10 15 and eventually at 10 17 , to 
be performed inside a 10 m-tall vacuum chamber built at 
Stanford University. They have imaged single clouds [33] 
of 87 Rb atoms with TV = 4 • 10 6 atoms, 200 pm initial 
radius, 2mm/s initial velocity spread, T = 1.15 s and a 
reported shot noise limit A g sn ~ 4 • 10 ” 12 g. With these 
values the contribution from ICE to the acceleration er¬ 
ror § is reported to produce a phase shift of 0.18 rad 
(Table 1, term 5 in |J3]) which, if compared to the phase 
shift of 2 . 1 - 10 8 rad produced by the leading g term, yields 
A g ~ 8 • 10“ 10 g. The authors are aware that the mea¬ 
surement is limited by seismic noise. Nevertheless, by 
comparing various portions of the imaged cloud and ex¬ 
tracting correlated phase noise over many runs they suc¬ 
ceed in reducing the phase shift noise by ~ 100 , thus 
inferring an acceleration sensitivity A g — 6.7 • 10 ~ 12 g, 
close to the shot noise limit. 

The “Space Time Explorer and Quantum Equivalence 
Principle Space Test” (STE-QUEST) proposal studied by 
ESA as candidate to a medium size mission mmm 
aims at a UFF test to 77 = 2 • 10 ” 15 by dropping atoms 
in a dual isotope 85 Rb, 8 'Rb interferometer in low Earth 
orbit at h ~ 700 km altitude (where g{h) ~ 8 ms ” 2 and 
7 (h) ~ 2.83 • 10 ” 7 g(h)/m). It is expected to be limited 
by a random shot noise differential acceleration A g sn ~ 
3.66 • 10 ~ 13 g(h) defined as (see |2H P-11): 

Ag sn = V2— -j= (g) 

~ 2.93 • 10 ” 12 ms ” 2 ~ 3.66 ■ 10 ” 13 g(h) 

where A = 780 nm is the laser wavelength, k = ^ ex¬ 
ploits the technique for enhancing the area of the inter¬ 
ferometer, T = 5 s is the free evolution time (time inter¬ 
val between subsequent light pulses and C = 0.6 is the 
contrast. 

By performing n = 1.48-10 5 drops, uncorrelated and in 
the same experimental conditions, the random shot noise 
([ 8 j can be reduced to 9.5 • 10 ~ 16 g(h) which is a factor 
2.1 below the target violation signal 2 • 10 ” 15 g(h). In the 
experiment design outlined in 211 -20 s repetition time, 
0.5 hr out of 16 hr dedicated to the experiment at perigee- 
one measurement with this number of drops requires 3 
years to be completed, within a total mission duration of 
5 years. 


For the random acceleration error ([6]) not to exceed the 
shot noise limit ([8]) the atom clouds are required to have 
TV=10 6 atoms at detection with 

300 pm initial radius , 82 77 ms ” 1 initial velocity spread 

(9) 


IV. HEISENBERG’S PRINCIPLE AND 
INTEGRATION TIME; SHOT NOISE VS 
SYSTEMATIC RELEASE ERRORS 


Each test mass, as well as a single atom, must obey 
Heisenberg’s uncertainty Principle (HP), which states: 

Ap 0 ■ Az 0 > | (10) 

where h = 1.054 • 10” 34 Js is the reduced Planck constant 
and the linear momentum A p 0 contains the mass of the 
body. For a single atom, because of its extremely small 
mass, HP requires (in the case of Rb): 


(Av 0 ■ Az 0 ) HP _ atorn > 

£ rriRb 


h 

> - 


1 


2 85.468 • 10” 3 ‘ Na m / S 
> 3.7 • 10” 10 m 2 /s 


( 11 ) 


where m R b = ^ 777 — kg is the mass of a Rb atom and 
N A = 6.022 • 10 23 is Avogadro’s number. 

For each cloud made of 7V=10 6 Rb atoms released to¬ 
gether, the random errors on the initial center of mass 
velocity and position are reduced by y/N. This is equiv¬ 
alent to a free mass with position error (y/NAz 0 ) and 
momentum error (m R bVNAv 0 ), for which HP requires 
(Nm Rb Av 0 ■ Az 0 ) HP _ freemass > h/2, hence: 

/AAA ^ 1 Na 2 / 

( Vo ■ HP—freemass — 2 85.468 • 10 ” 3 ' ~N~ m ^ 

> 3.7 • 10 ” 16 m 2 /s . 

( 12 ) 


This is the ultimate limit, since it is the HP limit for a 
single, free Bose-Einstein-Condensate of TV atoms, and 
as such a lower limit to the initial conditions of the real 
experiment (free atoms released from an optical trap). 

In order to quantitatively assess the implications of 
Heisenberg’s uncertainty principle in cold-atom drop 
tests of UFF we refer to the STE-QUEST proposed space 
experiment because it has been investigated within ESA 
for several years and literature is available that pro¬ 
vides all the information needed for quantitative assess¬ 
ment mmm- A similar experiment was investigated 
by ESA (with participation by the author) to be per¬ 
formed on the International Space Station [19] . However, 
the target was less ambitious than for STE-QUEST, and 
the ESA report of that study is an unpublished draft. 
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STE-QUEST aims at rj = 2 • 10 —15 , a target that would 
improve the current best tests performed with macro¬ 
scopic masses by a factor of 50, which makes the proposal 
worth considering despite the huge gap (by a factor of 50 
million) which separates it from the level that atom in¬ 
terferometers have achieved so far mmm- 

In the STE-QUEST atom clouds ([9]) each atom obeys 
the HP limit 0 by a factor 66, that is the product of 
its position and velocity errors is above the uncertainty 
limit by a factor 66. Their centers of mass have position 
and velocity errors smaller by a factor \J~N = 10 3 , hence 
they are above the HP limit (12) by the same factor 66. 


With a comparable share of error in position and velocity 
it means that each error is roughly a factor \/66 — 8 
above its HP limited value. By comparison, position and 
velocity errors of the single clouds realized by [33] are a 
factor \/l.l • 10 3 ~ 33 above the HP limit. 

In every drop the contribution Q of random ICE to 
the acceleration difference between the clouds amounts, 
with the planned initial condition errors to \/2 ■ 
A 9 IOB _ einglecloud ~ 2 - 84 ' iO -13 g(h), which is slightly 
smaller, by a factor 1.3, than the expected shot noise 
limit ([8|. Thus, the same number of drops that need to 
be performed in order to reduce the shot noise a factor 
2.1 below the target signal will reduce (also as 1 /y/n) 
the initial random errors hence the random differential 
acceleration error V2 • An „ , , - to a value 2.7 

times smaller than the signal. 

With the same number of atoms, were it possible to 
run the experiment with position and velocity errors at 
exactly the limit of Heisenberg’s uncertainty principle, 
they would result in a random differential acceleration 
error a factor \/66 ~ 8 smaller, of 3.5 ■ 10 -14 g(h ), which 
is the lowest possible limit and a factor 10 below the 
expected shot noise 

Let us now consider the systematic error 0- 

In the list of STE-QUEST systematic errors the re¬ 
quirements set by the proposers for the offset errors 
Az a _ re i, Av 0 _ re i at release are (see [21], Table 4, first 
entry): 

Azo-rei = 1-1 nm Av 0 — re i = 0.31 nm/s . (13) 


First of all, it is interesting to compare these require¬ 
ments with the position offsets between the centers of 
mass as required in UFF/WEP experiments using macro¬ 
scopic proof masses. 

It was pointed out by (± that GG (3T| is the only 
proposed space experiment in which the test masses are 
coupled and motion occurs around a position of rela¬ 
tive equilibrium independent from initial conditions (as 
in the case of the torsion balance). In GG the proof 
masses are coaxial cylinders rotating around the symme¬ 
try axis, weakly coupled in 2D (the plane perpendicular 
to the spin/symmetry axis) whose physical property of 
self-centering (starting from construction/mounting off¬ 
sets of 10 /im) makes the gradient effect compatible with 
a test 200 times more sensitive than STE-QUEST, to 
77 = 10 -17 . In Microscope (35], to be launched in April 


2016, the coaxial test cylinders are sensitive along the 
symmetry axis while rotation occurs along an axis per¬ 
pendicular to it; the position offsets required by con¬ 
struction/mounting amount to 20 /im, to be reduced to 
0.1 /im by offline data analysis (over various measure¬ 
ments) based on the specific, known signature of the 
gradient effect so as to bring it below a target which is 
two times more sensitive than that of STE-QUEST, i.e. 


Tj = 10" 15 . 

In STE-QUEST, if the requirements (131 are met in 
every drop (though they don’t need to be measured to 
this level in every drop) the differential acceleration 0 
is a factor 2.7 below the target signal. 

It is crucially important to verify that the offsets be¬ 
tween different atom clouds at release meet the require¬ 
ments (13), and that they do meet them in all drops for 
the entire duration of the experiment. Assume that a 
fraction / < 1 of the required number of drops n has ini¬ 
tial offset errors that are larger than required by (13), to 
the extent that the resulting differential acceleration er¬ 
ror (J7| is k > 1 times larger than in the remaining (1 —f)n 
drops which meet (13). Then, the resulting average error 
in the differential acceleration is: 


2- = /n.tA., + (l-/)„.A g = m _ 1} + 1]i9 

(14) 

For instance, if a fraction / = 10% of the drops have off¬ 
sets that produce acceleration errors k = 21 times larger 
than those produced in the remaining 90% of the drops in 
which (13) are met, the resulting systematic error ([T]) will 
be 3 times bigger, which is larger than the target signal 
and indistinguishable from it with a single measurement. 

The size of the atom clouds at initial time (in position 
and velocity space) is the size of the trapped clouds. It 
is limited by HP 0 (see [36]), and the centers of mass 
position and velocity are limited by HP (12). 

When confined, clouds of different isotopes/species 
have different size, because of the different physical prop¬ 
erties of the atoms, including mass. In STE-QUEST 
the mass difference alone results in a size difference of 
3 fj, m. In current instruments, estimates of the offset 
errors at release are many orders of magnitude away 
from the requirements (13). In the UFF test |L5 to 
?y ~ 1(T 7 the estimates reported are A z 0 - re i = ±0.2 mm 
and Avo—rei A 6mm/s, which are abo ut 1.8 • 10 5 and 
1.7 • 10' times larger, respectively, than (13). 

Nevertheless, there is no theoretical limit to the accu¬ 
racy with which the centers of the two clouds can be made 
coincident. In principle, with enough care in preparing 
and releasing the trap, the offsets can be made to meet 
(131. However, such preparation requires validation and 
only repeated measurements in the actually realized trap 
can provide it. In order to be excluded as the cause of 
any anomalous acceleration found in the experiment (vi¬ 
olation?) the initial offsets must be measured, and the 
measurement is limited by HP (12), namely by the un¬ 
certainty limit in position and velocity of the center of 
mass of each cloud. 
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On the other hand, the required initial offsets (13) are 
well below this limit. Because of the extremely small 
mass of the clouds (even 10 6 atoms are very few com¬ 
pared to Avogadro’s number) they are below the HP limit 
(12) by a factor \/l.l • 10 3 ~ 33. Thus, the uncertainty 
principle prohibits the initial offsets to be measured to 
the required precision in a small number of drops. 

Only by measuring them for the entire integration time 
of the UFF test, and by averaging over as many drops 
as required for the test, they can be proven to meet the 
requirements. 

Should STE-QUEST aim at testing UFF only two 
times better than the present goal, tor] = 10 ~ 15 , it would 
require an integration time 4 times longer, of 12 years, 
to complete one single measurement and to measure the 
initial offsets to the level required! In addition, the ini¬ 
tial size and velocity spread of each cloud would have to 
be only a factor 4 above the HP limit. These facts ex¬ 
plain why the target of STE-QUEST could not possibly 
be pushed to 10” 15 in order to make it competitive with 
Microscope. 

Assuming that all technical problems are solved, and 
the initial offsets are negligibly small, still they must be 
measured and the integration time needed (for a given 
target) to reach the precision required is dictated by HP 
limit (12). This is a fundamental limit and can be relaxed 
only by increasing the number N of atoms in the clouds 
(without increasing position and velocity errors in their 
centers of mass), which would as well reduce the shot 
noise limit. 

Should the target of STE-QUEST be one order of mag¬ 
nitude less ambitious, to 2 • 10 -14 , with the same shot 
noise, then initial condition errors could be 10 times 
larger, hence the clouds would be a much safer factor 
\/665 ~ 82 above Heisenberg’s limit and the integration 
time would be a factor 100 shorter, requiring 11 days. 
Offset errors at release could also be 10 times larger. 
They would still be below the HP limit for each cloud 
(but only slightly, by a factor y/ll ~ 3.3), and their 
measurement would require the same integration time, 
but this would now be realistic and leave enough time 
for checking their systematic effect. At this level limita¬ 
tions would not be fundamental but mostly technical, as 
it is inevitable given the 5 million gap from the current 
state of the art. However, this goal would be 20 times less 
sensitive than Microscope’s goal and only 5 times better 
than the current best tests of of UFF/WEP, making the 
case for an expensive space mission extremely weak. 

In its current design STE-QUEST proposes to mea¬ 
sure the offsets at apogee (while drops to test UFF are 
performed at perigee), by producing atom clouds and 
imaging them in order to verify, based on their evolu¬ 
tion, how far apart they were at release (see [21], P-12). 
For this approach to work, one should demonstrate that 
systematic errors are the same in both cases, and that the 
accuracy of the imaging method is close to Heisenberg’s 
limit. 

The key difference between random and systematic er¬ 


rors must be kept clearly in mind. Once a random error 
has been reduced to a certain level the result is appar¬ 
ent, and if it has reached the design level the measure¬ 
ment is over. Instead, a systematic effect which is known 
to mimic the signal requires a number of different mea¬ 
surements, each of them to the target level, in order to 
verify its specific signature, i.e., the way it depends on 
some physical parameters, for it to be distinguished from 
the signal beyond doubts. It is well known that a very 
long integration time rules out the possibility of a careful 
check of systematic errors and questions the significance 
of an experimental result. 

It has been suggested to cancel the Earth’s gradient by 
placing a mass nearby. A reduced value of 7 would reduce 
the systematic effect 0 and possibly make it irrelevant, 
at least when aiming at r] = 2-10~ 15 . Inside the small ex¬ 
perimental region in which atoms are dropped the Earth’s 
gradient is almost constant, but time varying depending 
on the orbital motion and the attitude of the spacecraft. 
Instead, the mass must be fixed (to avoid bigger prob¬ 
lems), and very close by (at 50 cm more than 2 tons would 
be required). Hence, its gradient changes across the re¬ 
gion but it is constant in time. As a result, the Earth’s 
gradient would be either under compensated or over com¬ 
pensated, thus not solving the problem. On the ground 
this difficulty may be reduced because the Earth’s gra¬ 
dient in the experimental region does not change with 
time (the lab doesn’t move), and a large mass could be 
placed far away. However, with the atoms falling at 1-g 
the experimental chamber inside which the Earth’s gra¬ 
dient must be compensated is certainly larger (as in the 
10m evacuated tower at Stanford). 

Another possibility may be worth investigating. In¬ 
stead of using position and momentum as conjugate vari¬ 
ables (subject to the uncertainty relation ( 10 )) one may 
try to combine them in one single variable such that its 
error can be minimized at the expense of the error in its 
conjugate. This technique (known as squeezing) has been 
recently applied to shot noise in atomic clock measure¬ 
ments m with a reduction equivalent to increasing the 
number of atoms by a factor 100 , i.e. equivalent to reduc¬ 
ing the phase measurement noise, hence the acceleration 
shot noise, by a factor 10 . 


V. CONCLUSIONS 

In experiments to test the universality of free fall and 
the weak equivalence principle with free macroscopic 
masses, initial condition errors are a well known limi¬ 
tation El 0 0. Macroscopic masses have Avogadro’s 
number on their side, while cold-atom drop tests are lim¬ 
ited by Heisenberg’s principle because of the vanishingly 
small mass of the atom clouds. 

The proposed STE-QUEST space experiment [lHl E2L 
[21] investigated by the European Space Agency needs 
3 years to complete one single measurement of the uni¬ 
versality of free fall to 2 ■ 10“ 15 within a total mission 







duration of 5 years. It is known that such a very long 
integration time is set by the need to reduce the random 
shot noise. 

Here we have investigated the effects of initial condi¬ 
tion errors in the presence of the Earth’s gravity gradient, 
and in particular the systematic differential acceleration 
due to (systematic) offset errors at release between clouds 
of different isotopes/species. We have shown that: 

(i) the requirements © on position and velocity off¬ 
set errors at release for this systematic differential ac¬ 
celeration not to exceed the shot noise lim it are -for the 
goal of STE-QUEST- a factor \/l.l • 10 1 2 3 ~ 33 below the 
limit set by Heiseberg’s principle; 

(ii) the systematic offset errors (JtJ) must meet the re¬ 
quirements in all drops, and this must be ensured by 
measuring them, which requires -because of Heisenberg’s 
principle limit for each cloud- the same integration time 
needed to reduce the random shot noise; 

(iii) the systematic nature of the effect caused by off¬ 
set errors at release demands -for the experiment to be 
reliable and its result to be acceptable- that more mea¬ 
surements are performed until it is possible to distinguish 
this effect from the target violation signal. 

The integration time set by the uncertainty principle 
can be reduced only by increasing the number of atoms 
in the clouds (as long as this is done without increasing 
their position and velocity errors), which would as well 
reduce the shot noise. 

The Achille’s heel of light pulse atom interferometers 
in testing the universality of free fall to 2 • 10” 15 and 
better by dropping atoms appears to be the extremely 
small mass of the atom clouds. 

Were STE-QUEST aiming at a 10 times less ambitious 
goal, to 2 • 10 ” 14 , it would not hit the fundamental lim¬ 
its outlined here. In this case the challenges would be 
mostly technical, in order to bridge the 5 million gap 
which separates this goal from the current 10 ”' level of 
UFF/WEP drop tests with cold atoms. 

When aiming at 2 • 10 ” 15 -like STE-QUEST in its cur¬ 
rent design- or better, ways may be pursued to overcome, 
or at least to alleviate, the limitations pointed out in this 
work. 


Squeezing techniques can be investigated, which would 
reduce the effect of initial condition errors, thus allowing 
the corresponding requirements to be relaxed. 

The very large number of drops needed seems in¬ 
evitable; however, one might optimize the time needed 
to perform them both in space and on the ground. 

Partial compensation of the Earth’s gradient by means 
of an appropriate artificial mass nearby seems unrealistic 
in space but may be attempted on the ground with a 
trade-off between the free fall time (hence the size of the 
experimental chamber) and a reduced gradient (hence 
relaxed requirements on initial condition errors). 

A zero check by dropping the same atoms with some 
clever technique to allow them to be distinguished as sug¬ 
gested by can be investigated on the ground. If well 
designed, no appreciable violation should occur and the 
experiment would reliably establish the limiting value of 
77 that a cold-atonr drop test of UFF can achieve. 

Nonetheless, a UFF test to a few 10 ” 15 by dropping 
atoms appears to be hard. Cold-atom tests which are not 
based on mass dropping approach (such as [ 22 ]) should 
not be affected by initial condition errors and maybe 
worth closer attention by the community. 

As at present, a comparison with space experiments 
using macroscopic masses and not based on the mass 
dropping approach shows that Microscope [35] (to be 
launched in April 2016) can make one measurement to 
10 ” 15 in 1.4 d while GG [33] requires a few hours to 
reach 10 ” 17 ; the limitation being thermal noise at room 
temperature in both cases but in different frequency re¬ 
gions of the signal due to different up-convertion rates by 
means of rotation [3B, 39. 
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